A proof of Schrodinger's equation 

Howard Covington 
Isaac Newton Institute for Mathematical Sciences, Cambridge, England 

(Dated: May 20, 2010) 

Abstract 

Schrodinger's equation for a single particle is proved from the assumption that dynamics can be 
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. formulated in a space whose curvature is the electromagnetic force. 
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Einstein's general theory of relativity brought to physics the idea that the gravitational 
force is geometric in origin and may be identified with the curvature of spacetime. The 
mathematics needed to apply more generally the idea that force can be identified with 
curvature is the theory of vector bundles. This is a part of a theory that was completed 
by mathematicians in the 1950s. By that time, physicists had already discovered its main 
ideas for themselves as gauge theory^. It is nonetheless worthwhile to seek to express the 
dynamics of particle in a space whose curvature can be identified with the electromagnetic 
force, since this leads to a proof of Schrodinger's equation and so to elementary quantum 
mechanics. 

General relativity describes curvature by attaching to each point in spacetime a vector 
space called the tangent space at that point. Differential relationships between vectors in 
neighbouring tangent spaces are defined by the Christoffel symbols r "^(i). A covariant 
derivative acts on a vector field v a (x) according to 

D/ = 3/ + r;/ (i) 

The covariant derivative may be used to define parallel transport of a vector. Curvature is 
the change in a vector when it is parallel-transported around an infinitesimal closed loop. In 
the theory of vector bundles^ a space S is created by attaching to each point in spacetime a 
vector space that can carry the representation of a group. Differential relationships between 
vectors in neighbouring vector spaces are defined by a connection A a b (x). If the group is 
the group of rotations around a unit circle, then the vector space is equivalent to the space 
of complex numbers. The simplest object that can inhabit S in this case is a complex- valued 
function ip(x). The connection now becomes wholly imaginary and the covariant derivative 
acts on ip according to 

= + i-yApip, (2) 

where is real and 7 is a real number introduced for convenience. The curvature, F^ u , of S 
is given by F^ u = d^A v — d v A^. It is invariant under group operations, which in this context 
are called gauge transformations, and satisfies the second Maxwell equation e^^dpF^ = 0. 
This establishes that the Faraday tensor is the curvature of a space S whose vector space is 
the space of complex numbers. 

The question is now how to define the dynamics of a test particle in this space. Parallel 
transport in S is defined in relation to the vector space of complex numbers rather than in 
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relation to the tangent space of a curve so, in contrast to general relativity, cannot be 
used to define a class of curves that parallel transport their tangent vectors. So geodesies 
are not defined by and there is no natural equation for a test particle trajectory. We 
therefore have to look elsewhere for a formulation of dynamics. To this end we seek a class 
of functions if> that have the dynamical properties of classical particles. 

Suppose if) is given by a differential equation. A^ defines differential relations, so it will 
appear in this equation, if> will therefore appear to interact with an electromagnetic field, 
so it must have a charge and will be a source for A^. The evolution of if) and A^ will 
therefore be coupled together. Since the electromagnetic field may be specified by a least 
action problem 3 , if) and A^ can be coupled together consistently if if) is also specified by 
a least action problem. The problem of defining if) is therefore equivalent to defining an 
appropriate Lagrangian for if). The formalism of vector bundles now comes into its own, 
because the Lagrangian for if) in the presence of the connection A^ can be obtained from 
the free Lagrangian C for the free field if> when the connection is zero by the substitution of 

by given by (J2J). This is just the minimal coupling prescription from quantum field 
theory appearing here in geometric form. 

Since if) has charge it must represent charged matter. Accordingly, it should be possible 
to construct a specific if> that represents a classical charged particle. C will then determine 
a function that represents a free particle. The dynamics of a free particle are specified by its 
4-momentum, P M , its angular momentum and its position. Classically, the 4-momentum is 
constant, unique, has a positive energy and is otherwise constrained only by the requirement 
from the special theory of relativity that P 2 = m 2 c 2 . We therefore seek a free Lagrangian 
that enables if) to incorporate these properties, insofar as it is possible to do so. 

To make progress we use Noether's theorem 4 . Subject to appropriate boundary condi- 
tions, this theorem associates with each symmetry of a least action problem a conserved 
current satisfying d^j^ = and a constant of the dynamics equal to J j°d 3 x. When 

the free electromagnetic field is specified by a least action problem, the four constants de- 
fined by Noether's theorem that arise from invariance under spacetime translations are equal 
to the components of its 4-momentum^. Similarly, the six constants that arise from invari- 
ance under rotations and Lorentz transformations are the components of the field's angular 
momentum tensor. 

The electromagnetic field and the function if) are going to be coupled together in a single 
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least action problem, so for consistency, ^'s momentum and angular momentum must be 
defined by applying Noether's theorem to the least action problem for ijj. For convenience 
we refer to iff's momentum defined in this way as its Noether momentum. 

We now have enough substantially to determine £. £ is a functional of ip and its deriva- 
tives. For the least action problem to be well defined, £ must be real. For i/j to have 
dynamical behaviour £ must contain at least ip and its first derivatives. So that the least 
action problem is formulated covariantly and is independent of gauge transformations, £ 
must be a Lorentz- and gauge-invariant scalar. For £ to be gauge-invariant each term in 
£ must be at least a covariant bilinear combination of ip* (or a derivative of ip*) and ip (or 
a derivative of if) ) or a higher order gauge-invariant combination. So that £ captures the 
dynamics of a free particle, it must include the particle's mass m. 

The simplest Lagrangian that satisfies these requirements consists of covariant bilinear 
combinations of a scalar function ip and its complex conjugate ip* and their derivatives up 
to first order. It therefore contains two arbitrary real constants, denoted for convenience h 
and fi 2 , and may be expressed as 



The Euler-Lagrange equation for the least action problem is the Klein-Gordon equation 
(d 2 + fi 2 )ip = 0. The general solution contains terms that would have both positive and 
negative energy^. To avoid the negative energy solutions we choose to consider only solutions 
of the form 



When we examine how tp interacts we will want to expand it as a sum over plane wave 
states. So for analysing interactions, this choice limits the theory we are constructing to the 
low energy approximation where the functions e are complete on a surface of constant 
time. 

The value for 7 in fl2]) can now be fixed in terms of ip's charge. Since £ is gauge invariant, 
it is invariant under the phase symmetry ip — >■ e~ iX i/j, where A is a real number. By Noether's 
theorem this phase symmetry gives rise to a vector j M satisfying c^j' M = where^ 



£ = h (d^d^ip - /iVV) • 



(3) 




(4) 




(5) 
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In the presence of a connection, <9 M is replaced by in C. Denoting the resulting Lagrangian 
by Cd, it follows from fl2]) that 
dC D 

dA, A=0 ~ 1% y d(d,r) d(d^) 
In the presence of an electromagnetic field, the complete Lagrangian for the least action 
problem is the sum of the electromagnetic Lagrangian, —^F fJiU F^ 1 ', and Cd- When is 
varied to determine the least action, the electromagnetic Lagrangian will contribute d u F u ^ 
and Cd will contribute ^p. Inserting appropriate constants^, the condition for the action 
to be minimized for variations in A^ will be 

L dyF ^ = _ c ^£ (7 ) 
Ho dA^ 

By the first Maxwell equation, d v F vp " = fioJ 11 , the right hand side of ([7]) must be identified 
with the electromagnetic current density associated with ip and the zeroth component with 
^'s charge density. So from ([6]), C7J M can be identified with the electromagnetic current 
density for the free field ip. Integrating cyj° establishes that the charge q of the free field ip 
satisfies q — 7 J j°d 3 x. 

The problem with this is that J j°d 3 x is different for each function ip given by the same 
C, so 7 is not constant and the covariant derivative is not properly defined. We can solve 
this problem in the following way. Using the expression (jlj) for ip in t^j we find that 
/ j°d 3 x = hj d 3 k\a(k)\ 2 . Since the Klein-Gordon equation is linear, if ip is a solution then 
so is cip, where a is any complex number, and for this solution J j°d 3 x is multiplied by \a\ 2 . 
We now agree to use a particular value of a to represent all ips that are generated by varying 
a. By choosing this value to be a = (^f d 3 k\a(k)\ 2 ^j , we arrange that J j°d 3 x = h, 
ensuring that the covariant derivative is independent of ip. As a consequence we have 7 = I 
and the covariant derivative defined in (T5]) becomes 

D li = d ll + i^A ll . (8) 

Defining normalized coefficients a{k) by a{k) = aa(k), -ip takes the standard form 

Having determined 7, we now turn to ^'s momentum. From the translation invariance of 
the least action problem, ip's Noether momentum, P M , is given^ by 

p " = / ,ex (sm^' + wwf r ~ g °' c ) ■ (10) 



To express P M more compactly we use (j3j) and the fact that £, is bilinear to introduce a 
symbol (tp\0\ip), where O is a linear operator, defined by 

(if)\0\if)) = ^Jd 3 x 

We denote ( , 0|/|V') by where 7 is the identity operator, so that = 1. It then 

follows^ that 

P^ = (12) 
With ^ given by (jHj), then using (JTU1) or (|T2|) we have 

P» = h J d 3 k\a(k)\ 2 k». (13) 

P° is positive provided ft and fc° are both positive. If £ is replaced by p£, where p is any 
real number, then the Klein-Gordon equation, and therefore if), are unchanged but P M is 
multiplied by p. If if) is to represent a free particle then its Noether momentum must be 
unique. We must therefore eliminate the freedom to vary p in this way. To arrange this 
we choose a specific C to represent the different £'s that are generated by varying p by 
agreeing a unique positive value (as yet unknown) for h. If if> is a plane wave of the form 
if> = a(k)e~ thx then its momentum is just P M = hh^, which is the de Broglie relation. 

Just as Noether momentum is defined in relationship to the translational invariance 
of the least action problem, we can define Noether angular momentum from its rotational 
invariance. Using the symbol {if)\0\if>) defined in ( ITTi) . the result is a 3- vector J of constants^, 
where J = (if>\x x (—iftV)\ip) . If we assume that if> is localized, then it is also meaningful to 
define a measure (x) of the position of if) by (x) = (ip\x\ip). 

We now turn to the question of how to represent a particle in the theory. If if> is a plane 
wave, the momentum condition P 2 = m 2 c 2 can be satisfied with P M given by P^ = hk^ 
provided that satisfies h 2 k 2 = m 2 c 2 . This can be arranged by choosing the parameter 
p in the Klein-Gordon equation to be equal to mc/H. If if> is the linear combination (Q, 
the momentum condition will not then be satisfied exactly but will still hold approximately 
provided the coefficients a(k) are only non-zero in a narrow range Sk of values for k where 
\5k\ \k\. Provided that h is sufficiently small, if) can, in principle, be constructed so that 
it is both localized on a small scale and its momentum satisfies the momentum condition to 
a good approximation. 
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(11) 



In this way we are led to the proposal that a localized function ■0, or wave packet, is a 
possible representation of a free particle in this theory. From the point of view of the classical 
dynamics we started from, this is not an ideal solution since it does not satisfy exactly the 
requirements we set. But it is the best we can do. For the moment we assume that ip is 
always localized on a scale much smaller than the laboratory scale for making observations 
of particles so that we cannot observe the internal structure of a wave packet. 

To ensure that the momentum condition is satisfied we have relied on the form of the 
Klein-Gordon equation. If higher order derivatives had been included in £ this would not 
have been achieved or, at least, not in a simple way. If £ contained terms that were higher 
order than bilinear in ip* and then the field equations would not be linear. The Noether 
momentum would then not be proportional to an unconstrained wave vector k^. For these 
reasons we conclude that £ must be bilinear in ip* and ip and contain derivatives of first, 
but no higher, order. A scalar function ip is only the simplest object that can inhabit S. 
A complex function with several components could equally well be defined on S. So that 
a ip with several components may be combined into a Lorentz-invariant Lagrangian the 
components must be drawn from a particular representation of the Lorentz group. There 
will therefore be a class of relevant Lagrangians, whose members correspond to different 
irreducible representation, all of which are bilinear and contain only first order derivatives. 

For each Lagrangian in this class the arguments leading through to equation f lT3|) will 
be unchanged in character save that ip, a(k) and a(k) acquire an index, J, that labels 
components and the form of the Lagrangian will be different for each representation. Since 
the constant J j°d 3 x and the Noether momentum are constructed covariantly, they will 
both depend on the amplitude aj{k) only through the combination |a(A?)| 2 = Tiia}(k)ai(k), 
so the reasoning leading to ([8]) will be the same. Just as fi has to be independent of ip 
for the covariant derivative to be properly defined, so h must also be independent of the 
representation from which ip is drawn and we must always have 7 = | in the covariant 
derivative. Lagrangians for different representations must include appropriate numerical 
factors to give this result, h will then be the same for all representations and becomes a 
structural constant of the theory. We consider later the significance of ip having several 
components. 

We now have everything we need to prove Schrodinger's equation. In the presence of an 



electromagnetic field, the Lagrangian becomes £d = h (D^ipy^D^ip) — (^) 2 ip*ip 



. The 
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Euler-LagranEre equation is [D^ + (2£) J ip = 0. The low -energy approximation to this 
equation may be found in a standard way^-. Setting Aq = <ft/c and defining the differential 
operator tt by n = —iftV, the result is Schrodinger's equation 



dip 



i>. (14) 



The differential operator on the right hand side of this equation takes the form H(tt, x) where 
H(p, x) is the classical Hamiltonian for a particle in an electromagnetic fielcr^. We thus 
establish from first principles that there is a strikingly simple map from classical mechanics 
to the theory of functions ip in the low-energy approximation. 

The simplicity of this map is no coincidence. The form p4|) of Schrodinger's equation is 
a direct consequence of defining ip's momentum by Noether's theorem, of the requirement 
for the free Lagrangian to be bilinear if properties are to be like those of a free particle 
and of the minimal coupling of ip to the electromagnetic field that results from treating the 
electromagnetic force as the curvature of a space. That the classical Hamiltonian has the 
same form follows from a theorem in classical mechanics^^, that the Hamiltonian for a 
broad class of forces that depend on position and/or velocity but not acceleration can be 
put in the minimal coupling form H(p,x). A simple map between classical and quantum 
mechanics is therefore built in to the construction of both theories. The classical result was 
discussed in comments on Feynman's proof of Maxwell's equations 15 . 

Having found Schrodinger's equation we can work back to a free Lagrangian for it and 
then calculate the symbol (if)\0\if)) and thus ip's Noether momentum. We find that (ip\0\ip) 
is just J d 3 xip*Oip. (ip\ip) can now be used to normalize any ip and to define a scalar product 
for any two functions. So, in the low energy approximation all square-integrable functions 
ip in the standard form are normalized functions, or rays, in Hilbert space 16 and can 
be treated by the theory of functions and linear operators on such a space, ip has Noether 
3-momentum (ip\jr\ip) and angular momentum (ip\x x where these are now the familiar 
quantum mechanical expressions. Its position is [ip\x\ip). The position and momentum 
operators satisfy [x\ 7r- ? ] = ihS^ . By seeking the dynamics of a particle in a space for which 
the electromagnetic field is its curvature we are therefore led to create a representation of 
a particle by a ray in Hilbert space and to infer the usual commutation relations for the 
position and momentum operators. 

To complete the proof we consider how ?/>'s properties might be observed. When if) inter- 
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acts with an electromagnetic field it can transfer momentum to the field. Since momentum 
is conserved, changes in ^'s momentum can in principle be observed by measuring changes 
to the momentum of the electromagnetic field. The same remarks apply to ^'s angular 
momentum. Since ip is assumed to be localized, [ip\x\ip) will be the approximate location of 
the charge q. This can also be observed through the electromagnetic field it gives rise to. So 
we expect momentum, angular momentum and position to be observables and to be given 
by the expression (ifj\0\ip) where O is the relevant self-adjoint operator in Hilbert space. 

A calculation of the scattering of an incoming wave packet from a potentiate shows, how- 
ever, that we cannot always assume that ip is localized. The outgoing wave is a shell whose 
radius can, in principle, grow indefinitely and so assume laboratory dimensions, making the 
internal structure of a wave packet available for observation. 

Such an observation might be made by seeking to detect t/>'s charge q using the sort 
of equipment whose use led to the definition of charge and the formulation of the laws of 
electromagnetism. Suppose such a detector has a cavity of dimensions 5y for capturing 
charge and that ip is localized within some region 5x. Provided 5y is large enough entirely 
to contain 5x, we may assume that the detector unambiguously registers the charge q and 
thus registers the detection of ip. For convenience, we may also assume that using the same 
equipment we can arrange to measure ip's momentum and its mass. 

By the early 1900s it was established that the charge associated with matter came in 
discrete units of a minimum charge ±e and that at a small scale there were particles, such 
as the electron, that carried just this charge and had a characteristic rest mass. Suppose 
ip represents one of these particles and is scattered so that it becomes an extended wave 
front whose breadth is much greater than the width 5y of a detector's cavity. Furthermore, 
suppose that detectors can be arranged, and the wave front is sufficiently extensive, that it 
encounters several detectors simultaneously. Since charge does not exist in an amount less 
than e, a particular detector must either register the whole of the charge e or no charge at all. 
If a detector registers the charge e it will register the detection of t/>; otherwise it will register 
nothing. Moreover, if one detector registers the charge e, then since charge is conserved, no 
other detector can register it. So even if the function i/j simultaneously encounters a number 
of detectors, at most one detector can register a detection. But all detectors will potentially 
respond to ^'s charge in the same way. So to produce the correct result from one detector 
and to avoid multiple detections, ip must collapse instantaneously into a new normalized 
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function that is only non-zero within the dimensions 5y of the detector that registers ^'s 
charge. The detector will then register the charge of this new function. 

This conclusion is clearly the consequence of representing a particle that has a property 
that is both conserved and discrete by a spatially extensive normalized function. We have 
framed the above argument in terms of charge, but it could equally well have been framed 
in terms of rest mass or any other observable property we might subsequently discover (such 
as spin) that is both discrete and conserved. 

To see what happens to r/>'s momentum when it is detected and collapses, consider ip 
in the form of a shell moving outwards from the origin. Such a ip can De expressed as 
the standard linear combination (Q. Suppose that we cover a sphere centred at the origin 
with N contiguous charge and momentum detectors numbered from n = 1, . . . , N. Let the 
nth detector subtend a small solid angle AQ(n) at the origin. The only components of the 
combination that will reach the nth detector are those whose momentum vectors k lie within 
AQ(n). The amount of f/>'s Noether momentum, P M (n), attributable to these components 
is, from ( H~3|) 

P»(n)= / dPk\a{k)\ 2 hk^ « \a{n)\ 2 hk^{n) (15) 

JkeAQ(n) 

where |a(n)| 2 = Jg eA ^( n ) d 3 k\a(k)\ 2 and it has been assumed that AQ(n) is so small that k^ 
is the same for all k lying within AQ(n) and is denoted /c M (n). If the nth detector registers 
a detection, the momentum it will measure will correspond to the new normalized function 
into which ip collapses and will therefore be just hk^{n). It is clear from (fl5|) that ^'s 
Noether momentum P^ is the sum of all the P M (n). 

Suppose that the momentum hk^{n) is transferred to the particle detector as part of the 
detection process. Then the balance of the Noether momentum of the scattered particle 
AP M (n) = P M — %k^(n) must be transferred to the source of the scattering potential so 
that momentum is conserved. Now suppose this experiment is repeated many times. The 
rate of transfer of momentum to the source will in principle be observable as a force on 
the source that is additional to the Lorentz force that arises from the scattering itself. But 
the laws of electromagnetism do not include such a 'detection force', so there is a potential 
inconsistency. We can reconcile the existence of this force with the failure to detect it in 
the following way. The experiments that underpin the laws of electromagnetism typically 
involve large numbers of charged particles. The fields, currents and charges in these laws 
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are averages over these particles. These particles will scatter off one another, so that charge 
detectors may be presented with the kinds of linear combination we have been considering. 
The detection force would, however, escape detection if it averaged out to zero over the 
large number of particles involved in the experiment or over many repetitions of the same 
experiment. 

For AP M (n) to average out to zero the relative frequency of detection by the nth detector 
would have to be |a(n)| 2 . Since, in principle, particle detectors can be made as small as 
we like, we may conclude from this argument that the relative frequency of detection of the 
component a(k)e~ tkx in ip must be |a(fc)| 2 . 

Although we have examined a if) created by scattering, there is nothing special about it 
other than our ability to detect individual components because they have been separated in 
space. We infer that detection applied to any linear combination of plane waves, where it 
is physically possible to detect individual components, operates according to a probabilistic 
model, where the probability of detection, Pr(fc), is |a(fc)| 2 in order to produce the relative 
frequency of detection required for consistency. 

According to this probability model, the Noether momentum of a linear combination of 
plane waves must now be interpreted as the average of the observations over many repetitions 
of the same experiment. We therefore have from ( fT2|) and ( [131) 



(if)\ihd^\ip) is thus a statistical expectation value for the observations of the eigenvalues of 
the operator ihd^ . 

Using the machinery of function theory in the low energy approximation, this argument 
can be extended to the eigenvalues of other self-adjoint operators in situations where the 
corresponding eigenfunctions are physically separated and so can be individually detected. 
In this way we obtain the general form of the quantum mechanical measurement postulate. 

Finally, we consider the consequences of starting from a Lagrangian for a if> that has 
multiple components. We have already noted that such a Lagrangian will have to be bilinear, 
so observables will still be calculated from bilinear expressions {ip\0\if>). This bilinearity 
means that the symmetry group of the theory is the covering group of the Lorentz group 
rather than the Lorentz group itself. The significance of this is that if> may belong to spinor 
as well as tensor representations. 




(16) 
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The new consequence of replacing the scalar function if) by a many-component function is 
that a term of the form (if)\ihS\if>) is added to the expression (if)\xx(—iKV)\ip) for the Noether 
angular momentum, where S is a vector of spin matrices. Schrodinger's equation is the low 
energy approximate equation for the many-component if) but now includes a corresponding 
spin termi&. These spin terms can be calculated by working through possible Lagrangians 
for ips with different numbers of components. In particular, by using a 4-component spinor, 
the Dirac equation is obtained and approximated for low energies by Schrodinger's equation 
for a spin- 1 particle^. 

COMMENTARY 

The main features of the elementary quantum mechanics of a single charged particle in 
an external electromagnetic field, including its spin, can evidently be derived by seeking to 
construct a space for which the electromagnetic force acts as a curvature and that admits, 
as far as possible, the dynamics of a free classical particle expressed through a function 
if). From this point of view, Schrodinger's equation can be interpreted as the low-energy 
approximate equation for a function with particle-like properties on a space whose curvature 
is the electromagnetic field. The probabilistic interpretation of the wave function is required 
to reconcile the existence of discrete, conserved, observable particle properties with the 
representation of a particle by a spatially extended function. The structure of the resulting 
theory is in essence what is summarized by the postulates of quantum mechanics. 

To arrive at this point of view we needed to assume the special theory of relativity, 
Maxwell's equations, the uniqueness of momentum and the existence for dynamics of a least 
action problem. The structure of wave mechanics then follows largely from the intersection 
of three ideas. The first is an extension to electromagnetism of Einstein's insight that 
there is a geometric formulation of the gravitational force as curvature. The second is 
that conservation laws arise from symmetries through Noether's theorem and the third is 
that particles have some discrete, conserved, observable properties. The representation of a 
particle by a ray in Hilbert space and the appearance of a constant h in the theory can be 
traced to the need for the covariant derivative to be properly defined and for the momentum 
of a free particle to be unique. 

Since we have not needed any explicit quantum mechanical assumptions, it may appear 
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that this proof obtains Schrodinger's equation from classical theory. This appearance is 
misleading and comes about because, once we have constructed a space whose curvature 
is the electromagnetic force, we seek to formulate particle dynamics in terms of a function 
ip. Once we do this, wave mechanics is implicit and the framework of special relativity, 
curvature, symmetry and discreteness is enough to determine the details. Our quantum 
mechanical assumption is therefore that particle dynamics can be formulated in terms of a 
function on the space whose curvature is the electromagnetic force. The proposal that a 
classical particle should be represented by a wave packet is then the best that we can do 
within the resulting theory. 

It is a fundamental dynamical idea that force is curvature and one that should be ac- 
ceptable as a point of departure for formulating quantum mechanics. It is at least logically 
possible that quantum mechanics might have been found in this way in the first place. In 
formulating general relativity, Einstein was able to use Riemannian geometry, developed 
some 50 years previously. If, in the interim, mathematicians had already completed the 
work that led to the theory of fiber bundles, then the discovery of general relativity might 
have prompted a treatment of electromagnetism along the lines described here, since the 
other ingredients were in existence by around 1920^. 

In practice, general relativity stimulated mathematicians to develop the theory of fiber 
bundles, a process that would take another 35 yearsi. Wave mechanics was arrived at by a 
different route that started from another idea of Einstein's, that the electromagnetic field 
is quantized. This idea was the basis for the de Broglie relations that played a significant 
part in prompting Schrodinger to formulate his equation. The two ideas eventually came 
together in the gauge theory of force in quantum field theory. 
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